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Abstract 

We solve the problem of minimizing the number of critical points among all 
functions on a surface within a prescribed distance S from a given input function. 
The result is achieved by establishing a connection between discrete Morse theory 
and persistent homology. Our method completely removes homological noise with 
persistence less than 26, constructively proving the tightness of a lower bound on 
the number of critical points given by the stability theorem of persistent homology 
in dimension two for any input function. We also show that an optimal solution can 
be computed in linear time after persistence pairs have been computed. 

1 Introduction 

Measured data and functions constructed from measured data suffer from omnipresent 
noise introduced during the measuring process. Separating relevant information from 
noise is therefore a widely considered problem. 

Taking a topological point of view, we regard noise as a source of critical points. 
Indeed, even arbitrarily small amounts of noise (with respect to the supremum norm) 
may give rise to an arbitrarily large number of critical points. We may hence interpret 
critical points that can be eliminated by small perturbations as being caused by noise. 
Consequently, we consider the following optimization problem: 

Problem (Topological simplification on surfaces). Given a function f on a surface and 
a real number 6 > 0, find a function f$ subject to \\fg — < 5 such that f$ has a 
minimum number of critical points. 

The class of functions and the notion of critical points we work with will be clarified 
later; for now, we just want to mention that multiple saddles (such as a "monkey saddle") 
are counted here with multiplicity. 

The Bottleneck Stability Theorem [6], a fundamental result in the theory of persistent 
homology [10, 34], provides a lower bound on the number of critical points: 

Proposition (Stability Bound). For any function f$ with \\f$ — /||oo < 5, the number of 
critical points of f$ is bounded from below by the number of critical points of f that 
have persistence > 26. 
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Clearly the question about the tightness of this bound is of great importance for 
the significance of the Bottleneck Stability Theorem. In the present article, we show 
constructively that the bound given by the stability theorem is actually tight for functions 
on surfaces (see Theorem 16): 

Theorem (Tightness of the stability bound). Given a function f on a surface and a real 
number 8 > 0, there exists a function f s such that \\f s — < 5 and the number of 
critical points of fs equals the number of critical points of f that have persistence > 25. 

A similar statement does not hold in higher dimensions or for non-manifold 2- 
complexes, see Section 6.5. 

1.1 Overview 

Discrete Morse theory [14, 15] provides equivalents of several core concepts of classical 
Morse theory, like discrete Morse functions, discrete gradient vector fields, critical 
points, and a cancelation theorem for the elimination of critical points from a vector 
field. Because of its simplicity, it not only maintains the intuition of the classical theory 
but allows to go beyond it by providing explicit constructions that would become quite 
complicated in the smooth setting. 

Persistent homology [10, 34] quantifies topological features of a function. It defines 
the birth and death of homology groups at critical points, identifies pairs of these 
(persistence pairs), and provides a measure of their significance {persistence). 

Whereas (discrete) Morse theory makes statements about the homotopy type of the 
sublevel sets of a function, persistence theory is concerned with their homology. Our 
solution to the problem of topological simplification on surfaces relies on a combination 
of both theories. In particular, we make contributions to the following problems: 

Canceling a single pair of critical points of a function Forman [14] describes a 
simple method for eliminating pairs of critical points in discrete vector fields . Modifying 
a function according to the cancelation of a pair of critical points, however, is more 
difficult and requires additional effort. We first observe that a discrete gradient vector 
field induces a partial order on the cells of the underlying complex, giving rise to 
the notion of attracting and repelling sets (in analogy to the notion of stable and 
unstable manifolds in the classical theory). Building on these concepts, we describe a 
canonical method for eliminating a pair of critical points of a discrete Morse function. 
This complements Forman's cancelation method for discrete gradient vector fields. In 
particular, it is applicable in any dimension (Section 4.1). An informal description in 
dimension 1 is shown in Figure 1 . 

To cancel a pair of critical points whose values differ by d, our method modifies the 
function by ^ in the supremum norm, which is the minimum required for cancelation 
(see Figure 1). To achieve this minimum, elimination of critical points has to take into 
account the attracting and repelling sets of the canceled pair, containing cells of all 
dimensions; moreover, an arbitrary number of other critical points might have their 
value changed. This is in contrast to previous related methods [12, 3] which operate 
just on the 1 -skeleton of the surface (and on the 1 -skeleton of its dual) and does not 
affect other critical points, but modifies the function by d and hence is not a minimal 
modification in the supremum norm. Moreover, these methods do not extend to higher 
dimensions. 
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Figure 1: Cancelation of critical points, (a) shows the graph of a function together with 
the directions of its gradient vector field. The values of the repelling set (b) of the upper 
critical point (marked with f) and of the attracting set (c) of the lower critical point 
(marked with *) are cut off at the average value of the two critical points, creating a 
plateau (d). The old gradient directions are still consistent with the new function. The 
gradient vector field can now be reversed along the path between the critical points, 
eliminating the pair (e). The resulting function has a plateau, but can be perturbed 
slightly to become non-degenerate (f). 



Degenerate functions Morse theory, in any of its variations, fundamentally relies on 
the assumption that the critical points of the function considered are non-degenerate. 
This condition not only prevents the theory from being applicable directly to arbitrary 
input functions. The canonical function arising from canceling a pair of critical points 
has a plateau (see Figure 1) and hence is not a discrete Morse function. However, there 
is a Morse function arbitrarily close to it. This necessitates a method to deal with such 
degenerate functions. To do so, we devise a symbolic perturbation scheme (Section 2.2) 
based on discrete gradient vector fields, allowing to treat the degenerate case in the 
same way as the generic case by considering the larger class of pseudo-Morse functions; 
in particular, we do not require the input function to be generic. Instead of deriving 
information about critical points from the function directly (which leads to ambiguities 
in degenerate cases), we work with an explicit gradient vector field consistent with the 
function, coinciding with the usual discrete gradient vector field in the generic case. 
Our scheme always allows to construct a Morse function arbitrarily close to a given 
pseudo-Morse function and consistent with the given gradient vector field. 

A second symbolic perturbation scheme allows to relax the assumption that critical 
cells have unique function values. It extends the first perturbation scheme by explicitly 
maintaining a total order on the cells that is consistent with both the function and the 
gradient vector field. 

Multiple cancelations We establish a connection between persistence pairs and the 
cancelation of critical points by proving that for functions on surfaces, every persistence 
pair can eventually be canceled if a sequence of cancelations is performed according to 
a certain hierarchy on the persistence pairs (Section 3.3). The statement is no longer true 
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for manifolds in higher dimensions or non-manifold 2-complexes, where persistence 
pairs cannot always be canceled. 

Tightness of the stability bound While the stability bound can easily be seen to 
be tight when only a single pair of critical points is canceled, we need to ensure that 
repeated cancelation does not violate the (5-tolerance constraint. Again, the situation is 
different from previous work [12, 3], where simplification is treated separately for pairs 
of dimensions (0, 1) and (1,2). Our result requires to consider cancelation of pairs in 
different dimensions at the same time (Section 4.2). We provide a constructive proof of 
the tightness of the stability bound (Theorem 16). The construction is well suited for 
proving our theorem; however, it has a suboptimal quadratic time complexity. 

Efficient solution We show that, after persistence pairs have been computed in time 
<9(sort(n)) [10], an optimal solution to the topological simplification problem on surfaces 
can be computed in time 0(n) using simple graph traversal methods (Section 5). Hence, 
we match the time complexity of [3]. Since Theorem 16 is already established, we can 
use it to give a simple proof of correctness of the linear algorithm. 

This result is surprising in view of the fact that the topological simplification problem 
on surfaces is NP-hard when restricting to simplexwise linear functions on a triangu- 
lated surface. This follows from a recent result by Gray et al. [16], which states that 
minimizing the number of extrema of a simplexwise linear function with interval con- 
straints for the vertex values is NP-hard. Their argument can easily be adapted to our 
problem setting, where all constraint intervals are assumed to have length 26. Note 
that the emphasis on simplexwise linear (i.e., linear on each simplex, as opposed to 
just piecewise linear) functions is significant here: a multiple saddle can be split into 
several non-degenerate saddles by an arbitrarily small (in L M ) perturbation in the space 
of piecewise linear functions, but not in the subspace of simplexwise linear functions. 

This emphasizes the important role of discrete Morse theory in our problem: the 
hardness of the problem in the simplexwise linear setting arises from the possibility 
that the input contains multiple saddles, which is excluded by definition in discrete 
Morse theory. Going from simplexwise linear functions to discrete Morse functions 
(Section 2.3) can be interpreted as splitting multiple saddles. 

Energy minimization of simplified functions The solution to the topological sim- 
plification problem is not unique in general: both the 5-constraint and the simplified 
discrete gradient vector field impose a set of linear inequalities on the simplified func- 
tion, so the solution set is a convex polytope. This additionally allows to minimize a 
suitable convex energy functional. We employ this technique to remove artifacts from 
the initial solution and to improve the similarity to the input function (Section 6.4). 

1.2 Related work 

Topological simplification of functions within a ^-tolerance constraint has been consid- 
ered before by Edelsbrunner et al. [12] and Attali et al. [3]. The problem considered 
there differs from ours by a seemingly small but significant detail: in [12, 3] the critical 
points of the input function / that are not eliminated are additionally assumed to retain 
the same critical value for the output g. This restriction has serious consequences: while 
it allows for the elimination of critical points of / with persistence < 6, in certain cases 
not all critical points with persistence < 26 can be eliminated; an example is given 
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in [12]. Hence, under this restriction it is not possible to match the stability bound. 
Moreover, this result does not provide any information about the tightness of the stability 
bound since it considers only a restricted set of functions. 

The methods presented in [12, 3] can be interpreted as variants of the carving 
method proposed by Soille [31] in the context of terrain simplification. There is another 
popular method for removing extrema from terrains, called filling or flooding [19, 2, 9]. 
A combination of both methods has been proposed in [32]. Our methods of canceling 
critical points from a function can be interpreted as a combination of carving and 
flooding in the realm of discrete Morse theory. 

Apart from the above mentioned works, persistent homology provides the basis 
of several other elegant methods for computation and simplification of multi-scale 
structures derived from a function. For example, Edelsbrunner et al. [10] discuss simpli- 
fication of the persistent homology for flltmtions of simplicial complexes. Edelsbrunner 
et al. [11] and Gyulassy et al. [17] consider simplification of cell decompositions (Morse- 
Smale complexes) resulting from a given Morse function. Unfortunately, a simplified 
Morse-Smale complex does not directly give rise to a simplified function. Indeed, 
simplifying a Morse-Smale complex is closely related to simplifying a discrete gradient 
vector field. 

The problem of constructing discrete gradient vector fields (as opposed to functions) 
that minimize the number of critical points without constraints is addressed by Lewiner 
et al. [26] for surfaces and by Joswig and Pfetsch [20] for complexes of arbitrary 
dimension. King et al. [22] were the first to propose the combination of persistence 
with discrete Morse theory to simplify the gradient vector field of an input function on 
a 3-dimensional simplicial complex. Their method has quadratic time complexity and 
produces a simplified discrete gradient vector field but not a function. Moreover, it does 
not aim at optimality (in 3 dimensions, not every persistence pair can be canceled). 

Several statements of this article can also be transferred to the setting of piecewise 
linear Morse-Smale complexes. For example, Theorem 13 can be used to show that the 
successive simplification of a Morse-Smale complex on a surface proposed by Edels- 
brunner et al. [11] is always possible. This extends the Adjacency Lemma in [11], 
which shows a necessary but not sufficient condition for the successive cancelation of 
persistence pairs. 

2 Discrete Morse theory 

Classical (smooth) Morse theory [28] relates the critical points of a generic smooth 
real-valued function on a manifold to the global topology of that manifold. Forman 
[14, 15] carried over the main ideas of Morse theory to a combinatorial setting. We 
briefly review some important notions and results here that are used throughout this 
article, together with some extensions to Forman's theory that provide important tools 
for our solution. 

A CW complex 7C is a topological space constructed inductively: starting with 
a discrete set TCo of 0-cells, the n-skeleton 7C„ is formed by attaching /i-cells (open 
n-dimensional balls) by continuous maps S" _I — > 7C" _1 from their boundary to the 
(n - l)-skeleton. The set of cells of "7C is denoted by K. Throughout this article, we 
consider only finite CW complexes. Whenever a cell t e K is attached to a cell <x 
(i.e., cr c <9t, where 8t denotes the boundary of t), we call cr a face of t; a face of 
codimension 1 is called a facet. If all attaching maps are homeomorphisms, 'K is called 
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Figure 2: Reversing a gradient vector field along the unique path from dp to <p produces 
a gradient vector field in which the 1-cell <p and the 2-cell p are no longer critical. 

a regular CW complex. A regular CW complex whose underlying space is a 2-manifold 
is called a combinatorial surface. We refer to [27, 18] for details on CW complexes. 

2.1 Discrete vector fields 

One of the central concepts of discrete Morse theory is that of a discrete vector field - a 
purely combinatorial analogue of a classical vector field. 

Definition (discrete vector field, critical cell [14, 15]). A discrete vector field V on a 
regular CW complex 'K is a set of pairs of cells (<x, t) e K x K, with o~ a facet ofr, such 
that each cell of K is contained in at most one pair ofV. A cell o~ e K is critical with 
respect to V if a is not contained in any pair ofV. The dimension of a critical cell is 
also called its index. 

A pair (<x, r) in a discrete vector field V can be visualized as an arrow from a to t 
(as in Figure 2). 

In the following, we consider an important subclass of vector fields in which the 
arrows do not form closed paths. This can be made precise using the concept of V -paths. 

Definition (V-path [15]). Let V be a discrete vector field. A V-path Y from a cell o~q to a 
cell o~ r is a sequence (ctq, to, cr\, . . . , r r _i, cr r ) of cells such that for every < i < r — I: 

o~i is a facet of T; with (cr, ,r, ) € V and cr, + i is a facet of t,- with (07+1,1";) ( V. 

T is closed if 'cr = cr r and nontrivial if r > 0. We call dim cr the dimension of F. 

By a V-path from dp to cp we mean a V-path from a facet of p to <p (see Figure 2 for 
an example). 

Definition (discrete gradient vector field [15]). A discrete vector field V is a discrete 
gradient vector field if it contains no nontrivial closed V -paths. 

The main technique for reducing the number of critical points is that of reversing a 
gradient vector field V along a V-path between two critical cells p and (see Figure 2 
for an example). It provides a discrete analogue of Morse's cancelation theorem [30]: 

Theorem 1 (Forman [14]). Let cf> and p be two critical cells of a gradient vector field V 
with exactly one V-path T from dp to <p. Then there is a gradient vector field V obtained 
by reversing V along the path F. The critical cells ofV are exactly the critical cells ofV 
apart from {(f>,p}. Moreover, V — V except along the path F. 

Gradient vector fields on combinatorial surfaces have additional properties that do 
not hold in higher dimensions. The following property is readily checked using the fact 
that a 1-cell is only attached to at most two 0-cells, and at most two 2-cells are attached 
to a 1-cell: 
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Lemma 2. Two V -paths of dimension cannot branch at a common cell, and two 
V '-paths of dimension 1 cannot merge (except at their last cell). 

Corollary 3. Let p be a critical 1-cell of a discrete vector field V on a combinatorial 
surface. Then there are at most two V -paths from dp to critical O-cells, each starting 
at one of the two O-cells in dp. Similarly, there are at most two V -paths from facets of 
critical 2-cells to p. 

2.2 Pseudo-Morse functions and symbolic perturbation 

As in smooth Morse theory, a discrete gradient vector field can be understood as the 
gradient of some non-degenerate function in the following sense: 

Definition (discrete Morse function [14]). A function f : K — > R on the cells of a 
regular CW complex 7C is a discrete Morse function if there is a gradient vector field Vf 
such that whenever cr is a facet of t then 

(cr, t) iVf implies /(cr) < /(r) and (cr, f) € Vf implies /(cr) > /(t). 

Vf is called the gradient vector field of /. 

In contrast to simplexwise linear functions, which are determined by their function 
values at the vertices, discrete Morse functions take values on cells of any dimension. 

The gradient vector field of a discrete Morse function encodes only the sign of 
the difference between function values, not the difference itself. Therefore a discrete 
gradient vector field does not uniquely determine a discrete Morse function, but for 
every discrete Morse function / there is exactly one gradient vector field Vf. 

In order to be able to treat non-generic input functions, it is useful to consider a 
more general class of functions, which we call pseudo-Morse functions. Pseudo-Morse 
functions substitute the strict inequality in the definition of Morse functions by a weak 
one. 

Definition (pseudo-Morse function, consistency). A function f : K — > R on the cells 
of a regular CW complex 7C is a discrete pseudo-Morse function if there is a gradient 
vector field V such that whenever cr is a facet ofr then 

(cr, t) t V implies f(o~) < /(t) and (cr, t) e V implies /(cr) > /(t). 

In this case, we call f and V consistent. 

Note that a gradient vector field V consistent with a pseudo-Morse function / is not 
unique in general. The following lemma provides a useful characterization of discrete 
pseudo-Morse functions. 

Lemma 4. Let f : K — > R be a function on the cells of a regular CW complex 7C and let 
V be a gradient vector field on "K. Then f is a discrete pseudo-Morse function consistent 
with V if and only if for every e > there is a discrete Morse function / e : A' — > R with 
Wfe ~ /ll°o — e such that V is the gradient vector field of f £ . 

Proof. Assume that / is a pseudo-Morse function consistent with a gradient vector field 
V. There exists a discrete Morse function g whose gradient vector field V g is precisely 
given by V [14]. Let G be the maximum absolute value of g. Given e > 0, for each cell 
cr define / e (cr) := /(cr) + e^-. Then it is straightforward to check that f e is a discrete 
Morse function with gradient vector field V and ||/ e — /|U ^ e - 
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On the other hand, assume that for every e > there is a discrete Morse function 
f e : K — > R. consistent with V and ||/ e - /|| M < e. Choose e such that for every (p,p e K 
with f((p) + f(p) we have e < Wt^£)l i n m j s ca se, one easily verifies that / is a 
pseudo-Morse function consistent with V. a 

The previous lemma provides a symbolic perturbation scheme based on gradient 
vector fields in order to allow for non-generic (degenerate) input functions. Starting 
with a pseudo-Morse function /, we can choose a consistent gradient vector field V, 
which may not be unique. Lemma 4 asserts that there is a discrete Morse functions f e 
arbitrarily close to / and consistent with V. Therefore we can work with / as if it were a 
discrete Morse function with gradient vector field V. In particular, we use it to consider 
critical points associated to a pseudo-Morse function by choosing a consistent gradient 
vector field. 

This first symbolic perturbation scheme is not sufficient for all our purposes; the 
definition of persistence pairs in Section 3 not only requires a gradient vector field, but 
also a total order on the critical cells, which again might not be uniquely defined by 
a pseudo-Morse function / and a consistent gradient vector field V. We now derive a 
second perturbation scheme that meets these requirements. 

Since a gradient vector field imposes certain inequality constraints on the functions 
consistent with it, we can ask how these inequalities affect the relation between the 
function values of any two cells. We observe that any discrete gradient vector field gives 
rise to a strict partial order on the set of cells: 

Definition (induced partial order). Let V be a discrete gradient vector field and consider 
the relation <— y defined on K such that whenever o~ is a facet ofr then 

(cr, r) £ V implies cr <— y t and (cr, t) e V implies cr — >y t. 

Let <v be the transitive closure of <— y. Then <y is called the (strict) partial order 
induced by V. 

The interpretation of this partial order is that for any pseudo-Morse function / 
consistent with V and any two cells and p, the relation (p <y p implies f(<p) < f(p). 
The relation <— y is the covering relation of <y, i.e., <p <— y p implies <p <v p and there 
is no tfr with <p <y if/ <y p. The covering relation of a partial order forms a directed 
acyclic graph called the Hasse diagram (with edges oriented as suggested by <— y). The 
Hasse diagram Hy of <y is obtained from the Hasse diagram of the face lattice of 7C by 
inverting the orientation of all edges corresponding to pairs (cr, t) € V as described by 
Chari [5]. Hy has the property that (p <y p if and only if there is a directed path from p 
to <p. Note that cr <— y r implies /(cr) < /(t), i.e., both the arrow visualizing (cr, r) e V 
and the arrow symbolizing cr «— v r point towards a (weakly) decreasing function value 
of/. 

Assume we are given a pseudo-Morse function / consistent with a gradient vector 
field V. On the one hand we have the induced partial order <y. On the other hand the 
function / canonically induces a strict partial order <j given by <p <f p f(<p) < f(p). 
Since the two orders -< t and < v are compatible by assumption (there are no two cells 
(<p,p) with <p <y p and <f> >f p), we can merge them into a strict partial order < f V (the 
transitive closure of (<f U <y) c K x K). A linear extension of this order is now a strict 
total order -< consistent with both / and V. 

Definition (consistent total order). Let V be a discrete gradient vector field V consistent 
with a discrete pseudo-Morse function f. Then a strict total order < is called consistent 
with (/, V) if it is a linear extension of <f and <y. 
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This total order -< gives rise to a canonical function K — > N, which is a discrete 
Morse function and consistent with V. If we use this function as the function g in the 
proof of Lemma 4 to construct / e , then f € is an injective discrete Morse function with 
gradient vector field V and the total order induced by f e is -< again. We thus obtain a 
second symbolic perturbation scheme for situations where a total order on the cells is 
required. 

We make use of this concept in the following definition. A classical object of study 
in smooth Morse theory is the sublevel set {x e M : f(x) < t] of a function / : M — > R 
on a manifold M. In the discrete theory, the analogous object is the level subcomplex, 
and the equivalent construction using our second symbolic perturbation scheme is the 
order subcomplex: 

Definition (level subcomplex [14], order subcomplex). Let f be a pseudo-Morse func- 
tion on a regular CW complex 7C. Let the carrier of a subset L C K be the smallest 
subcomplex ofK containing all of L. Then for t e R, the level subcomplex is 



Similarly, let < be a strict total order on the cells K of a regular CW complex "K. Then 
for a cell cr e K, the order subcomplex is 



Like in the smooth theory, the homotopy type of level subcomplexes changes only 
at critical cells. The statement can trivially be rephrased for order subcomplexes: 

Theorem 5 (Forman [14]). Let V be a gradient vector field on 7C and let < be a linear 
extension of <\. If p and if/ are two cells such that p < if/ and there is no critical cell <p 
with respect to V such that p < <f> < if/, then 'Kiifi) collapses to 'K(p). 

The order subcomplexes provide a finer (cell-by-cell) filtration of the complex 7C 
than the level subcomplexes, in particular if / is degenerate. This turns out to be useful 
when working with persistent homology in Section 3. 

2.3 Piecewise linear functions and discrete Morse functions 

In this section we discuss a canonical relationship between discrete and piecewise linear 
(PL) Morse theory. As it turns out, it is possible to translate statements from one setting 
to the other seamlessly. Similar constructions have been used by King et al. [22], Attali 
etal. [3]. 

Assume that 7C is a simplicial complex. Let /pl be a simplexwise linear function on 
7C and let /o be its restriction to the 0-skeleton of K. The function /o inductively gives 
rise to a discrete pseudo-Morse function / in the following way. For each 0-cell a, let 
/(or) = fo{a). For a cell r with dim r > 0, let /(r) be the maximum value of / on any 
facet of t. The function / can easily be seen to be pseudo-Morse since it is consistent 
with the empty vector field V = (all cells are critical). Note that any level subcomplex 
of / coincides with the induced subcomplex of 7C on the corresponding sublevel set 
of /o. This induced subcomplex, in turn, is homotopy equivalent to the corresponding 
sublevel set of /pl [25, 29]. This means that from a Morse-theoretic point of view, the 
PL function /pl and the pseudo-Morse function / are equivalent. We conclude: 
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Theorem 6. Let fpi be a simplexwise linear function on a simplicial complex 7C. Then 
there is a canonical pseudo-Morse function f on'K such that for every t 6 R the sublevel 
set {x G 7C : fpL.ix) < t) is homotopy equivalent to the level subcomplex 'Kit). 

Vice versa, we can interpret any discrete pseudo-Morse function / on a regular CW 
complex as a simplexwise linear function / s( j : |sd7C| — > R on the underlying space of 
the barycentric subdivision sdTC. The bary centric subdivision of a regular CW complex 
7C is the order complex of the face lattice, i.e., the abstract simplicial complex sdTC 
whose vertices are the cells of 7C and whose simplices are the totally ordered subsets 
of the face lattice. The underlying space |sd *7C| is homeomorphic to K. The function 
/ s d is assumed to linearly interpolate the values of / at the vertices of |sd *7C| inside 
each simplex of |sd(7C)|. Again, the sublevel sets of / S( j are homotopy equivalent to the 
corresponding level subcomplexes of /: 



Figure 3: Illustration to Theorem 7, showing the homotopy equivalence of the level 
subcomplex 7C(f) to the sublevel set \x e |sd7C| : fs&ix) < t] for t - 5. From left to right: 
function / on 7C(5); barycentric subdivision sd7C(5) = A(K(5)) = A(7); vector fields 
defining the collapse of A(7) onto A(6) and of A(6) onto A(5); sublevel set of / s( j. 

Theorem 7. Let f be a pseudo-Morse function on a simplicial complex 'K. Then f 
induces a simplexwise linear function fd on |sd7C| such that for every t 6 R the level 
subcomplex "Kit) is homotopy equivalent to the sublevel set [x e |sd *7C| : f sc i(x) < t). 

Proof. Let V be a discrete gradient vector field on 7C that is consistent with / and let -< 
be a total order consistent with (/, V). Let K(t) and Kip) denote the cells of the level 
and order subcomplexes 7C(f) and "Kip), respectively. Let A(t/) denote the induced 
subcomplex of sd 'K on a vertex set U (we identify a cell p e K with the corresponding 
vertex {p} e sdTC). The induced subcomplex A(Zf(f)) is easily seen to be identical to 
sd7f(f). Let Fit) = {(f> e K : f(<f>) < t] c Kit). We now show that A(K(t)) collapses 
simplicially onto A(F(f))- See Figure 2.3 for an example. 

Let cr e Kit) \ Fit) and let cr_ denote its predecessor with respect to < We write A(p) 
for A({0 G K : <p < p}). We show that A(cr) collapses onto A(cr_). It follows from the 
definition of an order subcomplex that (cr, r) € V for a unique r € Fit). Consequently, 
for every simplex S e A(cr) with cr e S and t i S the simplex T — S U {t} is also 
contained in A(<x). Hence, these pairs (5, T) constitute a discrete gradient vector field W 
on A(cr) such that exactly the simplices containing cr (the vertex star of cr) are non- 
critical. This vector field W provides a simplicial collapse of A(<x) onto A(<x_) by 
applying Theorem 5 with an arbitrary linear extension of <y?. By repeatedly applying 
this argument, we obtain that A(/T(f)) collapses onto A(f (f))- This implies that the 
underlying spaces are homotopy equivalent. 

Finally, let / s( j be the simplexwise linear extension of / from the vertices of sd(7C) to 
the whole complex. Recall that the induced subcomplex A(,F(f)) is homotopy equivalent 
to the sublevel set {x € |sd7C| : f s dix) < t] [25, 29]. The claim now follows. □ 
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This equivalence allows us to translate back and forth between piecewise linear 
functions and pseudo-Morse functions, and to apply theorems of piecewise linear Morse 
theory to discrete Morse theory. 

In a similar fashion, a discrete pseudo-Morse function can be constructed from a 
function defined only on the 2-cells of a combinatorial surface by defining /(cr) as the 
minimum value of all cells that contain (rasa facet. This can be used to construct 
discrete pseudo-Morse functions from functions defined on cubical grids, such as 
pixel images, by interpreting each pixel as a 2-cell. The resulting level subcomplexes 
correspond to the cubical complexes extracted from images as described by Kaczynski 
et al. [21]. Vice versa, a pseudo-Morse function on a cubical complex can be interpreted 
as a function defined on a subdivided grid. This construction has been used in the 
examples in Section 6. 

Note that starting with a PL function and constructing a pseudo-Morse function 
consistent with the empty vector field means that initially all cells are considered critical, 
which is a point worth discussing. King et al. [22] propose to construct an initial discrete 
gradient vector field with critical cells corresponding to the critical vertices (in the PL 
sense, see [23, 13, 4]) of a (non-degenerate) input PL function instead. We omit such a 
step for two reasons. First, this step is unnecessary in our method and would not lead 
to different results. Second, the step can actually be interpreted as a special case of 
the topological simplification problem with 8 = 0. In this case, the problem reduces to 
minimizing the number of critical points among all gradient vector fields consistent with 
the input function. We discuss the simplification of a gradient vector field in Sections 3.3 
and 5.3. 

3 Persistent homology of discrete Morse functions 

The notions of persistent homology and persistence pairs were introduced in [10, 34, 6] 
in order to investigate the change of the homology groups in a filtration of a topological 
space (a nested sequence of subspaces). This concept can naturally be applied to discrete 
pseudo-Morse functions. The following definitions can be applied to cellular homology 
with coefficients in an arbitrary field F. We write H^CK) as a shorthand for the d th 
homology group H d (.9C; F) of % and HJK) = rf H d (9Q. 

Convention and Notation Throughout Section 3 we consider a pseudo-Morse func- 
tion / consistent with a gradient vector field V on a regular CW complex 7C and a strict 
total order < consistent with (/, V). 

3.1 Birth, death, and persistence pairs 

As a consequence of Theorem 5, the homology groups of order subcomplexes change 
only at critical cells of V. Let cr and t be critical cells such that cr < t and consider 
the inclusion map i' T,T : < K(cf) 'TC(t) between the order subcomplexes with regard 
to the total order <. This map induces a homomorphism i*' T : H t CK(cr)) — > //,(7C(t)) 
between homology groups. For every cell p, let p_ denote its predecessor with regard 
to <. Now consider the sequence 

H t {<K{crJ)) H t CK(cr)) H t (K(T-)) H*(<K(t)) 

of induced homomorphisms. Here we allow for the cases cr = t_ and <x_ = (if cr is 
the first cell in -<, in which case // H ,(7C(cr_)) is the trivial group). 
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Definition (birth, death, persistence pair [10]). We say that a class h e H*CK(cr)) is 
born at (or created by) a positive cell cr if 

hi im(if-'°"). 

Moreover, we say that a class h e H t CK(cr)) that is born at cr dies entering (or gets 
merged by) a negative cell r if there is a class h e H t CK(crJ)) such that 

C T -(h)<tim(i?- T -) but i? T (h) = i^' T Qi) e im(i?- T ). 

If there exists a class h that is born at cr and dies entering t, then (cr, t) is a persistence 
pair. The difference f(j) - /(cr) is called the persistence of (a, t). 

Note that in this definition we always have dim t — dim cr + 1 . On combinatorial 
surfaces, the only possible cases for (dim cr, dim t) are (0, 1) or (1,2). 

3.2 Duality and persistence 

For any closed combinatorial surface K, there is an associated dual complex ft*, a 
combinatorial surface homeomorphic to 7C whose /-cells correspond to (2 - /)-cells of 
7C [18]. A discrete pseudo-Morse function / on 7C gives rise to a discrete pseudo-Morse 
function f* on K* via cr* i-> -f(cr) [14]. 

Moreover, as shown by Cohen-Steiner et al. [7] and Attali et al. [3], the persistence 
pairs of dimension (1,2) for 7C correspond to the persistence pairs of dimension (0, 1) 
for the dual complex ft* (with t* -< cr* o cr -< f). The homology groups ffoC^O 9 ;)) 
(generated by the connected components of 'K(pi)), and hence the persistence pairs of 
dimension (0, 1), are determined solely by the 1 -skeleton of ft, also called the (primal) 
graph of ft. Consequently, the persistence pairs of dimension (1,2) are determined by 
the 1 -skeleton of ft*, called the dual graph. This means that all persistence pairs of a 
surface can be determined in terms of Morse functions on graphs. 

In order to treat surfaces with boundary, we employ the usual construction of 
attaching an additional 2-cell (with function value oo) to each boundary component. 
This way we obtain a closed surface having the same sequence of order subcomplexes 
(up to the additional cells) and hence the same persistence pairs as the original surface. 

3.3 The persistence hierarchy and sequential cancelations 

Persistence pairs on surfaces carry a certain hierarchical structure that allows us to 
establish a connection to the cancelation theorem of discrete Morse theory. The main 
result of this section is that persistence pairs on surfaces can always be canceled 
sequentially if the order of cancelations respects this hierarchy. 

Definition (parent, child, persistence hierarchy). On a combinatorial surface ft, let 
(cr, t) be a persistence pair with dim cr = 0, and let [cr] e Ho(ft(cr)) be the class created 
by cr. Let & be the unique cell creating the class [cr] e HqCK(t)) into which [cr] gets 
merged by t, i.e., [&] £ im(i*~' T ) and [cr] — i*' T ([o~]). Then & is called the parent ofcr 
(in the persistence hierarchy), and cr is called the child o/cr. The transitive closure of 
the child relation is called descendant. 

Let (cr, t) and (cr, f) be two persistence pairs. If either dim cr = dim & — and & 
is the parent of cr or dim r = dim f = 2 and f* is the parent of t* (with regard to the 
persistence hierarchy on the dual complex), then we also call the pair (cr, f) the parent 
of (cr, t) and (cr, t) the child of (&, f ). The following definition and lemma justify this 
nomenclature: 
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a 

Figure 4: The persistence hierarchy. Both (cr, r) and (&, f) are children of, and hence 
nested in, (cr, f). Only (cr, t) needs to be canceled before (&, f) can be canceled. 

Definition (nested pairs). On a combinatorial surface 7C, let (cr, r) and (cr, f ) fee two 
persistence pairs. We say that (cr, t) i's nested in (cr, f ) cr -< cr -< r < f. 

Lemma 8. Let (cr, t) be a descendant of(&, f) in the persistence hierarchy. Then (cr, r) 
/s nested in (cr, f). 

Proof. Without loss of generality, assume dim cr — 0; otherwise, by duality, the argu- 
ment can be applied to (t*, cr*) instead of (cr, t). 

By definition of the persistence hierarchy, [cr] gets merged into the class [cr] e 
HoCK(t)) created by cr. This implies that cr -< cr. It also implies that the class created 
by cr has not been merged by any cell of 7C(r), hence t < f. a 

We now turn our attention to the sequential cancelation of persistence pairs. Note 
that the cancelation theorem (Theorem 1) applies to vector fields, which only provide a 
partial order on the cells, while the notion of persistence is based on a total order. After 
canceling a persistence pair, the new vector field is no longer consistent with the initial 
total order. It is important to keep in mind that we only talk about persistence pairs 
of the initial total order -<, which is consistent with (/, V); we do not consider a new 
total order after applying a cancelation (which would complicate things considerably). 
Applying several cancelations results in a sequence of simplified vector fields: 

Definition (persistence cancelation sequence). A persistence cancelation sequence is 
a sequence of gradient vector fields (Vq, V\, . . . , V„) with Vq — V, where each V, is 
constructed from Vj_i by canceling a persistence pair (cr,, r,) using Theorem 1. 

A persistence cancelation sequence is called nested if in this construction every pair 
(cr,-, Tf) nested in another pair (crj, tj) is canceled first, i.e., cr j < cr-, < T; -< Tj => i < j. 

A persistence cancelation sequence is called a ^-persistence cancelation sequence ;/ 
exactly those persistence pairs are canceled that have persistence < 6. 

A persistence pair (cr, t) can be canceled from a vector field as soon as all descen- 
dants have been canceled (compare also to Edelsbrunner et al. [11] for the existence 
part of the following statement in a special case): 

Lemma 9. On a combinatorial surface "K, let (Vq, V\, . . . , V,) be a persistence cance- 
lation sequence. Assume that a persistence pair (cr, t) has not been canceled in the 
sequence but that every descendant of ( cr, t) has been canceled. Then there exists a 
Vrpathfrom <9t to cr and this path is unique. 

Assume further that every persistence pair nested in ( cr, t) has been canceled. If 
there is a unique Vj-path from <9t to another cell cr + cr that is critical in V, then we 
have cr > cr. 
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Figure 4 shows that the condition is sufficient but not necessary. The proof of 
Lemma 9 relies on a few auxiliary lemmas and is given after Lemma 12. 

Lemma 10. Let (Vq, Vi, . . ., V,) be a persistence cancelation sequence and let (cr, r) be 
a persistence pair with dim cr — that has not been canceled in the sequence. Let C be 
the connected component of the subcomplex 7C(r_) containing cr, and let C denote the 
cells ofC. Then every ((f>,p) € V; with dimcp — satisfies <p e C o p e C. 

Proof. The claim is shown by induction. The base case follows from consistency of 
the total order -< with (/, V). Consider the cancelation of a persistence pair (cr,-, t,). 
If dim cr, 4l 0, the tuples in V, of dimensions (0, 1) stay unchanged and the claim 
immediately follows from the induction hypothesis. Now assume dim cr, = 0. We show 
that the claim holds for every (<p,p) e V, \ Vj-i- 

The cells in V, \ Vj-i are r, and the cells on the V,_i-path (<po,po, <f>\,... ,p r -i,(f> r ) 
from <pQ e <3t, to <p r = cr,. By the induction hypothesis we have ^ e C o pk e C. 
Because C is a subcomplex, we also have pk-\ e C => <pk e C (with p_i = t,). Moreover, 
if cr, e C then cr,- is a descendant of cr and by Lemma 8 (cr,, r,) is nested in (cr, t), 
implying that cr,- and t,- are in the same connected component of 7C(t_). Hence we also 
have <x, e C => t,- e C. Consequently, either all or none of the cells in V, \ Vj_j are 
contained in C and the claim immediately follows. □ 

We also require the notion of the restriction of a vector field to a subcomplex: 

Definition (restriction of a vector field to a subcomplex). Let V be a discrete vector 
field on 'K and let "K be a subcomplex ofK with cells K. The restriction ofVto'K is 
V — V Pi (K X K), i.e., the pairs of cells in V that are both in K. 

As a direct consequence of this definition, the restriction of a vector field V onto a 
subcomplex may have critical cells that are not critical in V: 

Lemma 11. Let V be the restriction of a discrete vector field V on'K to a subcomplex/K. 
The critical d-cells ofV are exactly the critical d-cells ofV that are contained in 7C // 
and only if each pair (cr, t) e V with dim cr = d satisfies cr e K <=> t g K. 

Moreover, we use the following fact: 

Lemma 12. Let V be a discrete gradient vector field V on'K with only one critical 
0-cell cr. Then there is a V '-path from every 0-cell & to cr. 

Proof. Each V-path of dimension ending at a non-critical cell & + cr, (cr, f) e V, can 
be extended by f and the unique 0-cell & e df, & + &. Since K is finite and V does not 
contain nontrivial closed paths, the extension will eventually end up at cr. a 

Proof of Lemma 9. Without loss of generality, assume dim cr = 0; otherwise, by duality, 
the argument can be applied to (r*,cr*) instead of (cr, r). 

Let C be the connected component of the subcomplex 7C(t_) created by cr. Apart 
from cr, every 0-cell in C that is critical in V is a descendant of cr. By assumption, all 
descendants of cr have been canceled, and hence cr is the only 0-cell in C that is critical 
in V/. By Lemmas 10 and 11, cr is also the only critical 0-cell in the restriction of V, 
to C. By Lemma 12, there is a V,-path to cr from every 0-cell in C, in particular from 
exactly one of the two 0-cells in <9t since <9t n C contains exactly one cell. By Lemma 2, 
this path is unique. 

Now assume that every persistence pair nested in (cr, t) has been canceled and there 
is a unique V,-path from dr to another cell & + cr that is critical in V,-. By assumption, & 
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is not a descendant of cr, meaning that cr and cr are in different connected components 
of 7C(r_). Moreover, cr creates the component C + C, because otherwise we would have 
an uncanceled pair (cr, f) nested in (cr, r). Since r is paired with cr and merges C and C, 
we know that cr is a descendant of cr and cr > cr. □ 

As a consequence of Lemma 9, we can construct a sequence of cancelations to 
eliminate all persistence pairs below a certain persistence threshold: 

Theorem 13. Let f be a pseudo-Morse function on a combinatorial surface 'K and 
let 5 > 0. Then there exists a nested 5-persistence cancelation sequence. 

Proof. If the subsequence (Vo, V\, . . . , V;_i) satisfies the assumptions of Lemma 9 for 
some persistence pair (cr,, r,), we can use Theorem 1 to construct V, from V,-\. A 
canonical choice satisfying these assumptions is given by canceling the persistence pairs 
(cr ; ,T,) with persistence < 5 according to the order -< on the negative cells, i.e., t,- -< r !+ i 
for every i. The claim follows by induction. □ 



3.4 The stability bound 

Cohen-Steiner et al. [6] studied properties of persistence diagrams, which are a repre- 
sentation of the value pairs (/(c), /(t)) corresponding to the persistence pairs (cr, t) of 
a function /. Here we use R = RU {— oo, oo}. 

—2 

Definition (Persistence diagram [6]). The persistence diagram D(f) c R of a pseudo- 
Morse function f is the multiset consisting of(f(cr), f(r))for all persistence pairs (cr, r) 
of f, together with all points on the diagonal counted with (countably) infinite multiplic- 
ity. An unpaired positive cell cr is represented by (/(cr), oo). 

The main result of [6] is the Bottleneck Stability Theorem for persistence diagrams: 
if two functions are close then their persistence diagrams are also close. Due to the 
correspondence between piecewise linear functions and discrete pseudo-Morse functions 
(Section 2.3), the statement reads as follows in the language of discrete Morse theory: 

—2 

Definition (Bottleneck distance). Let X and Y be two multisets of R . The bottleneck 
distance is dg(X, Y) :— inf y sup^^ \\x — y(x)|| M , where y ranges over all bijections from 
XtoY. 

Here we assume (a, oo) - (b, oo) = (a - b, 0), (a, oo) - (b, c) — (a - b, oo), and 
\\{a, oo)!!^ = oo for a,b,c€ R. 

Theorem 14 (Cohen-Steiner et al. [6]). Let f,g:K—>Rbe two discrete pseudo-Morse 
functions. Then the respective persistence diagrams satisfy d B (D(f),D(g)) < ||/ - gW^. 

Note that the bottleneck distance provides a metric on the persistence diagrams 
of pseudo-Morse functions on 7C, in particular, dB(D(f),D(g)) = if and only if 
D(f) = D(g). Therefore, in contrast to the persistence pairs, the persistence diagram of 
a discrete pseudo-Morse function / is well-defined; in particular, it is independent of 
the total order < chosen and even independent of the gradient vector field V consistent 
with /. Theorem 14 provides a lower bound on the number of persistence pairs among 
all pseudo-Morse functions fg with \\fg - /|| M < 6: 

Corollary 15 (Stability Bound). For any pseudo-Morse function f s with \\f s - fW^ < 6, 
the number of persistence pairs of fs is bounded from below by the number of persistence 
pairs of f that have persistence > 26. 
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Proof. Let D and D$ be the persistence diagrams of / and fg, respectively. By Theo- 
rem 14 we have c/g(D, Dg) < 5. This means that there is a bijection y between D and 
D s with \\p - y{p)\\oc < 5 for all p e D. Let p = {p*,p^) = (/(c), /(t)) g D represent a 
persistence pair (<x, r) of / with persistence //'" - p* > 25. Letting q - (q*, q^) :- y(p), 
this implies that p* + 5 > q* and p< - 5 < q^. Together with - p* > 23, this 
yields </' - q* > 0. Hence there must be a persistence pair of fg corresponding to each 
persistence pair of / with persistence > 25. □ 



4 Function simplification guided by discrete gradient 
vector fields 

We are interested in functions that achieve the lower bound of Corollary 15: 

Definition (Perfect (^-simplification). Let f be a pseudo-Morse function on a combina- 
torial surface "K. A perfect ^-simplification of f is a pseudo-Morse function f s such 
that \\fg — /| loo < 5 and the number of persistence pairs of f s equals the number of 
persistence pairs of f that have persistence > 25. 

In this section, we prove the following central result: 

Theorem 16. Let f be a discrete pseudo-Morse function on a combinatorial surface. 
Then there exists a perfect ^-simplification of f. 

The proof of Theorem 16 is constructive and hence leads to an algorithm. The 
corresponding construction is outlined in Section 4.1. Unfortunately, the resulting 
algorithm has a running time that is quadratic in the input size. We present an efficient 
algorithm in Section 5. The proof of its correctness becomes easier once Theorem 16 is 
established. This is the reason why we present two separate constructions. 

Corollary 17 (Tightness of the stability bound). Given a discrete pseudo-Morse function 
f on a surface and 5 > 0, there exists a discrete pseudo-Morse fs consistent with a 
gradient vector field Vg such that \\fg — fW^ < 5 and the number of critical points ofVg 
equals the number of critical points off that have persistence > 25. 

Using Lemma 4, the result can also be stated for (non-degenerate) discrete Morse 
functions (in a slightly different form, because only critical points with persistence < 25 
can be eliminated within a tolerance of 5 in the set of discrete Morse functions): 

Corollary 18. Given a discrete Morse function f on a surface and 5 > 0, there exists a 
discrete Morse function fg such that \\fg — f\\oc < 5 and the number of critical points of 
fs equals the number of critical points of f that have persistence > 25. 

Convention and Notation Throughout this section we consider a given pseudo-Morse 
function / consistent with a gradient vector field V on a combinatorial surface 'K, a strict 
total order -< consistent with (/, V), and a nested 2<5-persistence cancelation sequence 
(Vo, . . . , V„) with Vo = V. Moreover, we let <j :- <y denote the partial order induced 
by Vj. 
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4.1 The plateau function 



For every V,- in the cancelation sequence, we inductively define a pseudo-Morse function 
fi consistent with V,, see Figure 1 for an illustration. By assumption we start with a 
pseudo-Morse function /o := / consistent with Vo := V. Suppose that we have con- 
structed a pseudo-Morse function consistent with Vi-\. Let (cr, t) be the persistence 
pair that is canceled in the construction of V, from V,-_i using Theorem 1. We define the 
corresponding plateau function f t as follows: 

if p >i-\ cr and fi-iip) < nij 
or p < H t and f-i(p) > m it 

fi-i(p) otherwise. 

This means that the attracting set \p : p >,_i cr} of cr is raised to at least the value m,-, 
and analogously the repelling set \p : p t) of t is lowered. Hence, ft creates a local 
plateau at the value ;«,-. The following lemma is a direct consequence of the way we 
construct f, from /;_] and the fact that /■ is constant along the path from 8t to cr. It can 
be proven using a straightforward induction argument. 

Lemma 19. The plateau function f is consistent with both Vi-\ and V;. 

Note that the construction of the plateau function does not depend on the properties 
of combinatorial surfaces but can be applied to regular CW complexes of arbitrary 
dimensions. Moreover, it does not depend on the cancelation persistence pairs: whenever 
we have a pseudo-Morse function / consistent with a gradient vector field V and V is 
constructed from V by a cancelation using Theorem 1, we can obtain a plateau function 
/ that is consistent with both V and V. 



mi = 



f(cr)+f(T) 



and fi(p):= 



4.2 Checking the constraint 

It remains to show that the plateau construction above is admissible, i.e., that all of the 
functions f satisfy the ^-constraint. 

Lemma 20. Each plateau function f satisfies \\f — /!!«, < 6. 

Proof. We show the statement by induction. The base case is trivial since /o = /. 

Let (cr, r) be the persistence pair that is canceled when constructing V,- from Vj_i. 
We show that the <S-constraint is neither violated by increasing the value of any cell p in 
the attracting set of cr in Vi-u nor by decreasing the value of any cell in the repelling 
set of t. Since f{p) = f-iip) for all cells p not treated in these two cases, the claim 
follows. 

We first show \f(p) - f(p)\ < 6 for any cell p >,_! cr with f-i(p) < nij. By the 
induction hypothesis we have a lower bound f-\{p) > f{p) - 5. By construction of f, 
the value of p is increased: f{p) = nij > f-i(p). Therefore, the lower bound remains 
valid after step i: 

fi(p)>fi-i(p)>f(p)-6. 
To show the upper bound f(p) < f(p) + 5, we first use f(r) - f(cr) < 26 to obtain 

ff , /(cr) + /(r) /(cr) + (/(cr) + 26) 

fi(p) = nit = < = /(cr) + 6. 

This is almost the desired inequality except that the right hand side contains /(cr) instead 
of /(p). To finish the proof, it therefore suffices to show that /(cr) < f(p). This, in turn, 
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Figure 5: Example illustrating Lemma 2 1 . Left: gradient vector field W (before reversing 
the path from dv to p). Right: gradient vector field W (after path reversal). Note that we 
have the new relation (p <^ ft (corresponding in this example to a W-path from ft to ft). 
In the example, the conclusion ft < w v and p <w ft of Lemma 21 is reflected by the two 
W-paths from dv to ft and from ft to p, respectively. 

is a consequence of the facts that, according to Lemma 22, cr -<,•_] p implies cr < p, and 
that -< is consistent with (J, V). 

It remains to show that \fi(p) - /(p)| < 6 for any cell p <,-_[ t with /j_i(p) > m,. The 
proof of this statement is analogous to the above. □ 

Before proving Lemma 22, we first investigate how the reversal of a gradient vector 
field may change the induced partial order (see Figure 5 for an example): 

Lemma 21. Let p, v, ft, ft be ( not necessarily disjont) cells of a regular CW complex 'K, 
and let W and W be two gradient vector fields. Assume that the cells p, v are critical 
in W and that W is constructed by reversing W along the unique W -path from dv to p. 
Assume further that ft -fc w ft and ft <^ ft. Then ft < w v and p <w ft. 

Proof. By definition of the induced partial order, ft <s ft implies that there exists a 
sequence (p\, . . . ,pk) with p\ — ft, pu — ft and p, <— ™ p, + i for all 1 < i < k - 1. Here 
either p,- is a facet of p, + ; or p, + i is a facet of p,-, and we therefore also have either 
Pi <~w P/+1 or pi — >w Pi+i- But since ft -Aw ft, there exists a smallest index j such 
that pj — *y? pj+\. Since the relations <— w and <— ^ differ only along the W-path from 
dv to p (including v), it follows that the cells pj and p /+ i are contained in this W-path. 
Hence we have pj < w v. Moreover, by the choice of j we have ft - p\ <w Pj- Therefore 
we conclude that ft <w v. By an analogous argument one also shows that p <w ft- □ 

Lemma 22. Let (Vq, . . . , V„) be a nested persistence cancelation sequence and let (cr, t) 
be a persistence pair of < with cr and T critical cells ofVi. Then for any p e K, 

(a) p >i cr implies p > a, and 

(b) p <i t implies p < t. 

Proof. We only present the proof of part (a), which is done again by induction: we show 
that p >i cr implies p > cr for all < i < n. Part (b) can be shown analogously. 

The base case i — is trivial since > is a linear extension of >q. Assume that 
p >i cr. If p >,_i cr, then the claim follows directly from the induction hypothesis. 
Hence we assume that p cr. Let (cr, f) be the persistence pair that is canceled 
when constructing Vt from Vj-i! this implies cr -<;_i f. From Lemma 21 with (W, W) - 
(Vj-\, Vj) and (p, v, ft, ft) = (cr, f , cr,p), we infer that cr f and & <i-\ p. This has 
two consequences: 

(i) cr -<;_] f (since cr is critical in Vt while f is not), and 

(ii) a < p (by the induction hypothesis). 
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To finish the proof of the claim, by (ii) it suffices to show that cr < <x. We proceed 
by case analysis on the dimensions of cr and cr. Since these two cells are positive by 
assumption, they have dimension less than 2. 

Case 1 (dim cr = 1, dim & = 0): This case cannot occur since reversing the V}_i -path 
from the 1-cell f to the 0-cell & does not change the attracting set of any critical 1-cell 
(and in particular cr), contradicting p cr and p >, cr. 

Case 2 (dim cr = 0, dimcr = 1): First assume r -< f. If additionally & < cr, this 
contradicts the assumption that the cancelation sequence is nested and (cr, t) is canceled 
after (cr, f ). Therefore t < f implies cr <&. 

Now assume r > f. This means that cr creates a connected component that is not 
yet merged in < K(f). Since cr < i _ 1 f by (i), there is a sequence (pi, . . . ,pd) with pi = cr, 

= f, and pj <— y._, p ;+ i for all 1 < j <k—l. For each p ; we trivially have p ; f 
and hence pj < f by the induction hypothesis, implying that pj e 7C(f). Moreover, since 
either pj is a facet of pj + \ orp i+ i is a facet of p^, we know that all pj, and in particular 
cr and f, are in the same connected component of 7C(f). In an analogous way one shows 
that cr and f , and hence cr and &, are in one and the same connected component. Since 
we know that cr created that component, it follows that cr < &. 

Case 3 (dimcr = dimcr e {0, 1}): The relation cr f from (i) above implies the 
existence of a V,_i-path from f to cr. We will show by contradiction that this path must 
be unique. To see this, assume that there are two Vj_i -paths from f to cr. Without loss of 
generality, assume that dimcr = dimcr = (and hence dimf =1); otherwise, by duality 
the following argument can be applied to cr* , f* instead of f, cr. By Corollary 3, each of 
the 0-cells in df must belong to exactly one of the two V,_i -paths from f to cr. Now by 
a similar argument as in Case 2 above, we obtain that each cell of these two Vj_i -paths 
is contained in the same connected component of "7C(f_) as cr. But since f is a negative 
1-cell, the two 0-cells in its boundary belong to different connected components of 
7C(f_), a contradiction. 

Hence, there is a unique V,-_i-path from f to cr. Lemma 9 asserts that & is the largest 
cell (with regard to <) with a unique V,_i-path from f to &. Since cr + &, we obtain 
cr < cr. a 

Proof of Theorem 16. According to Theorem 13 there exists a nested 2<5-persistence 
cancelation sequence (Vq, V\ , . . . , V„) for the pseudo-Morse function /. Let f n be the 
plateau function corresponding to V„. Since /„ is consistent with V„ by Lemma 19 and 
Wfn - f\\oo < 5 by Lemma 20, it is a perfect (5-simplification. □ 



5 An efficient algorithm 

The definition of the plateau function in the previous section canonically leads to an 
algorithm that runs in time quadratic in the input size. In this section we present a 
method for computing a perfect ^-simplification in time dominated by the computation 
of persistence pairs, i.e., (9(sort(n)), where n = \K\ is the number of cells of K. Apart 
from this computation, all steps of our algorithm take linear time 0{n). We stress that 
pre- and post-processing steps, like conversion from and to PL functions, also require 
only linear time 0(n). 

The algorithm can be summarized as follows. First, persistence pairs are computed 
using a variant of Kruskal's algorithm for minimum spanning trees. Next, the persistence 
pairs are used to construct a simplified gradient vector field by a graph traversal of both 
the primal and dual 1 -skeleton. In a third step, the simplified vector is used to compute 
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the simplified function by a graph traversal on the Hasse diagram of the partial order 
induced by the simplified vector field. 

5.1 Defining a consistent total order 

Assume we are given a pseudo-Morse function / consistent with a discrete gradient 
vector field V as input. We write (f> ^ v p if neither <p < v p nor (p > v p, and similarly 
for =! f. Let <t be an arbitrary total order on K. We define the order -< as the lexicographic 
order given by <f, <y, and we have <p < p if and only if either 

(a) <p< f p, 

(b) <p —f p and <p < v p, or 

(c) (f> -/ p and <p -v p and <p < T p. 

Now assume that / is constructed from data given as a PL or piecewise constant function 
as explained in Section 2.3. Then V is the empty vector field (all cells are critical), 
meaning that <p <v P if an d on ly if is a face of r. If now the order <j is chosen such 
that the cells are sorted by dimension, then <p <v P implies <p <j p. The definition now 
simplifies to: < p if and only if either 

(a) <p<fp or 

(b) </> -/ p and <p < T p. 

5.2 Computing persistence pairs 

Recall that the persistence pairs of dimension (0, 1) are determined solely by the 1- 
skeleton G of 7C. Therefore, persistence pairs can be computed by applying a variant of 
Kruskal's algorithm [24] for finding a minimum spanning tree to both the primal and the 
dual 1-skeleton [10, 3]. Let G be the 1-skeleton of K and M(G) the minimum spanning 
tree of G (using the total order -< for determining the edge weights, which implies 
uniqueness of M(G)). Kruskal's algorithm for computing M(G) initializes a graph T 
with the vertices of G, sweeps over the edges of G in order <, adds to T every edge of G 
that does not create a 1 -cycle, and returns the final graph T, Note that the set of edges of 
M(G) consists of all negative 1 -cells together with all 1-cells t with (<x, f) e V for some 
cr; all other 1-cells create a cycle in T. When encountering a negative 1-cell, we compute 
the persistence of the corresponding (0, 1) pair by storing for each connected component 
of the intermediate graph T the 0-cell that created it. Clearly we obtain all dimension 
(0, 1) persistence pairs this way. Simultaneously, we construct the subgraph Mg(G) of 
M(G) not containing the negative 1 -cells with persistence > 26. In an analogous way, 
for the dual 1-skeleton G* we can compute the minimum spanning tree M(G*) and 
obtain the subgraph M S (G*) together with all (1,2) persistence pairs. 

Kruskal's algorithm has a time complexity of 0(sort(n)), yielding a complexity 
of 0{n\ogri) for comparison-based sorting. Assuming that the function values are 
represented by a small (<9(log «)) word size, Attali et al. [3] point out that persistence 
pairs on a graph can be computed in linear time 0(n) on a RAM using radix sort together 
with a linear-time algorithm for minimum spanning trees. 

5.3 Extracting the gradient vector field 

We now explain how to construct a simplified gradient vector field Vs. To this end, 
we traverse (using depth-first search) each of the connected components of the primal 
graph Mg(G) (constructed in the previous section) from the 0-cell that created the 
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component. During this traversal, whenever we encounter an edge (1-cell) xfi that 
connects a previously visited vertex (0-cell) p with an unvisited vertex <f>, we add ((f), if/) 
to the gradient vector field V«y. This construction takes O(ri) time. 

We perform an analogous traversal for the dual graph Mg(G*). Again, whenever we 
encounter an edge if/* that connects a visited vertex p* with an unvisited vertex (f>* (with 
iff a 1-cell and p, <p 2-cells of the original complex), we add (iff, <f>) to the gradient vector 
field Vg. Note that the final Vg results from both the primal and dual traversals and is a 
vector field on 7C. 

Theorem 23. The gradient vector field Vg is identical to the final vector field V„ of a 
25-persistence cancelation sequence (Vo, . . . , V„). 

Proof. First observe that if (cr, t) e V„ and dim cr — 0, then both cr and t are ceUs 
of Mg(G) since all non-critical cells of V„ either are non-critical in V as well or have 
persistence < 26 (with respect to / and <). Moreover, the 0-cells creating a connected 
component of Mg(G) are the only critical 0-cells of V n (by definition) and of Vg (by 
construction). Since Mg{G) is a tree, the pairs (cr, r) € V„ with dim cr -Q are uniquely 
defined by this property. By applying the dual argument to MgiG*), the statement 
follows. □ 

5.4 Constructing the simplified function 

Finally, we construct a function fs (different from the plateau function defined in 
Section 4.1) that is consistent with the simplified gradient vector field Vg. Consider the 
Hasse diagram H := Hy 6 of the strict partial order <y ( as described in Section 2.2. We 
visit the vertices K of H in a linear extension of <y s . The problem of finding a linear 
extension of a partial order is also called topological sorting and can be solved using 
depth-first search on H [8]. At each visited cell cr, we define fg{<f) as the minimum 
value that satisfies the lower bound fg(cr) > /(cr) - 6 and renders fg consistent with Vg, 
i.e., 



The construction of fg also takes O(n) time. 

5.5 Correctness of the algorithm 

Theorem 24. The function fg constructed using the above algorithm is a perfect 6- 
simplification off. 

Proof. By construction fg is consistent with Vg. At the same time, by Theorem 23, 
Vg is the final vector field of a 2<S-persistence cancelation sequence. Therefore, by 
the definition of a perfect (^-simplification, it only remains to show that the constraint 
Wfs — /ll<w < S is satisfied. The lower bound fg > f - Sis satisfied by construction. It 
remains to show the upper bound fg < f + 6. 

Observe that the set of all perfect (5-simplifications consistent with Vg is defined by 
a set of linear inequalities: the upper and lower bounds on the function values given 
by / ± 6, and the inequalities that define consistency with Vg. Therefore, the set of 
(^-simplifications is a convex polyhedron P c W with n = \K\. The polyhedron P is 
bounded since it is a subset of the product of intervals Y\a-eK[f( (T ) ~ 6, f(cr) + 6]. From 
Theorem 16, we know that P is not empty. We now show that fg is contained in P. 
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Figure 6: Visualization of simplification artifacts. Function values indicated by gray 
levels. Left: Original function. Middle: Function obtained by the algorithm of Section 5. 
Note the bright path joining the two spots. Right: Function obtained after constraint 
energy minimization according to Section 6.4. While the simplified topological structure 
is maintained, the visual appearance is closer to the original function. 

First, consider the (unbounded) convex polyhedron P defined by the lower bound 
fg>f — 5 and the inequalities induced by Vg. By construction, fg is contained in P. 
Moreover, again by construction, fg minimizes the function value of any cell among 
all functions in P. In other words, for any function / in P c P, we have / > fg. This 
implies the upper bound fg < f + 6. □ 

6 Discussion 

6.1 Computational results 

We implemented the algorithm of Section 5 in C++. For a complex with over 4 million 
cells (the cubical complex for a 1025 x 1025 pixel image), we obtained a running time 
of about 15 seconds for computing a perfect ^-simplification on a 2.4GHz Intel Core 2 
Duo laptop. 

6.2 Symmetrizing the algorithm 

The method described in Section 5 assigns to each cell the smallest possible value. As 
a consequence, the output function differs from the input function / even if the input 
function is already a perfect ^-simplification. Moreover, the method is not symmetric 
in the sense that we obtain an output function which maximizes the values if we apply 
the algorithm to the function -/ on the dual complex and return the negative of the 
simplified function. Since both the minimal and maximal solutions are points of a convex 
polyhedron as explained in Section 5.5, we can take the component-wise arithmetic 
mean to obtain another perfect (5-simplification. 

With this modification, if the input function / is already a perfect ^-simplification, 
then the minimal solution is given by / - 6, while the maximal solution equals / + 5, so 
the arithmetic mean of both solutions returns / again as desired. 
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Figure 7: Top: Topographic map of elevation data set "Puget Sound" [1], showing 
the region around Tacoma. Contour lines shown every 500 meters. Elevation data is 
converted from a 512 x 512 grid into a pseudo-Morse function on 1050625 cells. 33120 
critical cells have persistence > (persistence diagram shown on the right). Bottom: 
Simplified elevation function obtained after constraint energy minimization according 
to Section 6.4 with 6 = 500 meters. The function has 1 minimum, 3 saddles, and 3 
maxima. 
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6.3 Flooding and carving artifacts 

Since the methods presented in the present article can be seen as combinations of the 
carving and flooding approaches, they also inherit some characteristics that may not 
always be desirable in practical applications (see Figure 6). 

Carving methods [31, 12, 3]) cancel a pair of critical cells by changing only the 
repelling or attracting set of the 1-cell (saddle). This results in a noticeable thin path 
being carved in the function. On the other hand, modifying extrema, i.e., lowering 
maxima and raising minima, produces regions with constant function value; this is 
called filling or flooding [19, 9]). Although this effect is less disturbing, it might appear 
unnatural in certain applications. In the next section, we propose a way to remedy both 
kinds of artifacts. 

6.4 Combining topological simplification and energy methods 

As mentioned in Section 5.5, the set of perfect (^-simplifications consistent with the 
simplified gradient vector field Vg is a convex polyhedron P. Hence, the presented 
method can be combined with energy minimization methods, since the polyhedron P 
can be used as the feasible region for an arbitrary convex optimization problem. For 
example, we used the interior point solver Ipopt [33] to minimize (a discretization of) 
the Dirichlet energy of the difference fg — fin order to obtain a function fg that looks 
as similar as possible to the input function / (see Figures 6 and 7). Alternatively, we 
minimized the Dirichlet energy of the simplified function itself in order to obtain smooth 
contour lines. 

6.5 A counterexample for general 2-complexes 

The example of Figure 8 shows that a perfect ^-simplification may not exist on a non- 
manifold 2-dimensional cell complex. For the sake of simplicity, the example is given 
for a non-regular CW complex; it is straightforward to rephrase this example using a 
regular CW complex by subdividing the cells. The complex consists of two 0-cells £ and 
y with /(£) = f(y) = 0, three 1 -cells a, b, and c with f(a) = 1, f(b) = 2, and f(c) = 0, 
and the two 2-cells A and B with f(A) = 2 and f(B) = 3. Note that the complex is not 
manifold since it is not locally euclidean at the 1-cell b. The persistence pairs are (a, A), 
(b, B), and (y, c). To obtain a perfect ^-simplification for 6 = 0.5, one would need to set 
fs(b) = fg(B) = 2.5 and fg(a) = fg(A) = 1.5. The corresponding simplified gradient 
vector field would be Vg = {(a, A), (b, B)}. But since b is a facet of A, we must have 
fgib) < fg(A). Hence, we cannot cancel both (a, A) and (b, B) at the same time. This 
constellation also appears in [10] under the name conflict of type (1,2). 



t (0) 




a (I) 

Figure 8: A discrete Morse function on a 2-complex that does not have a perfect 
^-simplification. The function values of the cells are indicated in brackets. 
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Since such a 2-complex can also appear as a level subcomplex of an n-manifold 
CW complex for n > 3, the example also shows that a perfect ^-simplification does not 
always exists for functions on manifolds. 

6.6 Removing local extrema from functions on manifolds 

As a concluding remark, we want to mention that the same constructions and proofs 
presented in this article can also be adapted to the problem of minimizing the number of 
local extrema of a pseudo-Morse function within a (5-tolerance on any cZ-dimensional 
manifold CW complex. 

Problem (Extrema simplification on manifolds). Given a pseudo-Morse function f on 
a regular manifold CW complex and a real number 6 > 0,find a function fg subject to 
Wfs ~ /lloo < <5 such that fg has a minimum number of local extrema. 

Theorem 25. Given a pseudo-Morse function f on a finite regular closed manifold CW 
complex and a real number 6 > 0, there exists a pseudo-Morse function f s such that 
Wfs ~ /lU < 5 and the number of local extrema of fg equals the number of local extrema 
of f that have persistence > 26. This number is minimal. 

Note that in the case d = 2 this problem is equivalent to the topological simplification 
problem by the following argument. Let c, denote the number of critical cells of 
dimension i. Since the Euler characteristic ^ = cq - c\ + C2 is a topological invariant 
and we have Co + c\ + c^ — 2(cq + C2) ~X, the number of critical points is minimal if 
and only if the number of extrema is minimal. 

Acknowledgements The "Puget Sound" data set used in Figure 7 is taken from the 
Large Geometric Models Archive of the Georgia Insitute of Technology. The original 
elevation data is obtained from The United States Geological Survey (USGS), made 
available by The University of Washington. 
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